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1 Introduction 



Within the Standard Model (SM) exclusive dimuon decays of the and meson^ 
are rare as they occur only via loop diagrams and are helicity suppressed. New Physics 
models, especially those with an extended Higgs sector, can significantly enhance the 
branching fractions, although in some models the rates are lowered. 

The amplitudes contributing to the branching ratio B{B'^ — )■ fi^fi^) (where q = d,s 
for the and B^ mesons respectively) can be expressed in terms of the scalar (c^), 
pseudoscalar (cp) and axial vector (ca) Wilson coefficients in a completely general ap- 
proach [l]. Within the SM, the contributions of cs and cp are negligible while ca is 
calculated with an accuracy of a few percent |2]. The dominant contribution stems from 
an electroweak penguin with a decaying into two muons. However, the accuracy of 
the SM prediction for B{B^ — > fi~^fi~) is limited by the knowledge of the decay constants 
of the B^ mesons. This limitation can be reduced by normalizing to the well-measured 
mass differences of the 5° mesons. Using this approach j3j , the SM predictions are 

^ fi^fx')sM = (0.32 ± 0.02) X 10"^ , 
i3(5°^^V")sM = (0.010 ± 0.001) X 10~^ . 

Many extensions to the SM predict a very different Higgs sector. For instance, within 
the Minimal Supersymmetric SM (MSSM) in the large tan/3 approximation [4|, c^^p^^ oc 
tan^ (3/M\, where Ma denotes the pseudoscalar Higgs mass and tan/3 the ratio of Higgs 
vacuum expectation values. The most restrictive limits on the search for 5° — )■ 
have so far been achieved at the Tevatron, due to the large bb cross-section at hadron 
colliders. The best limits at 95% CL. published so far are obtained using 6.1 fb^^ by 
the DO collaboration §, i3(5° ^ /i+z^") < 5.1 x 10-^ and using 2 fb"^ by the CDF 
collaboration g, S(5^^ /x+/i-) < 5.8 x 10"*^ and B{B^ fi+fi-) < 1.8 x lO'^. The 
CDF collaboration has also presented preliminary results [t] with 3.7 fb^^, that lower the 
limits to S(B° ^ /i+zi") < 4.3 x lO'^ and i3(5° ^ /i+zi") < 0.76 x 10"^ 

The LHCb experiment is well suited for such searches due to its good invariant mass 
resolution, vertex resolution, muon identification and trigger acceptance. In addition, 
LHCb has a hadronic trigger capability which provides large samples of B^ — t- h^h~ 
decays, where h and h' stand for a hadron (kaon or pion). These are used as control 
samples in order to reduce the dependence of the results on the simulation. 

The measurements in this Letter use about 37 pb~^ of integrated luminosity collected 
by LHCb between July and October 2010 at ^/s = 7 TeV. Assuming the SM branching 
ratio, about 0.7 (0.08) 5° — )■ /i"*"//" (5° — > fi^fJ^^) are expected to be reconstructed using 
the bb cross-section, measured within the LHCb acceptance, of 75 ± 14 |j.b [8j. 

^In this Letter the inclusion of charge-conjugate states is implicit. 
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2 The LHCb detector 



The LHCb detector |9|j is a single-arm forward spectrometer with an angular coverage 
from approximately 10 mrad to 300 (250) mrad in the bending (non-bending) plane. The 
detector consists of a vertex locator (VELO), a warm dipole magnet with a bending power 
of J Bdl = 4 Tm, a tracking system, two ring-imaging Cherenkov detectors (RICH), a 
calorimeter system and a muon system. The VELO consists of a series of silicon modules, 
each providing a measure of the radial and azimuthal coordinates, with the sensitive area 
starting at 8 mm from the beam line during collisions. The tracking system comprises four 
layers of silicon sensors before the magnet and three stations equipped with silicon sensors 
in the inner part and straw tubes in the outer part after the magnet. Track momenta 
are measured with a precision between 5p/p = 0.35% at 5 GeV/c and 6p/p = 0.5% at 
100 GeV/c. The RICH system provides charged hadron identification in a momentum 
range 2-100 GeV/c. The calorimeter system consists of a preshower, a scintillating pad 
detector, an electromagnetic calorimeter and a hadronic calorimeter. It identifies high 
transverse energy (-Et) hadron, electron and photon candidates and provides information 
for the trigger. Five muon stations composed of MWPC (except in the highest rate 
region, where triple-GEMs are used) provide fast information for the trigger and muon 
identification capability. 

LHCb has a two-level trigger system both for leptonic and purely hadronic final states. 
It exploits the finite lifetime and relatively large mass of charm and beauty hadrons to 
distinguish heavy flavour decays from the dominant light quark processes. The first trigger 
level (LO) is implemented in hardware and reduces the rate to a maximum of 1 MHz, the 
read-out rate of the whole detector. The second trigger level (High Level Trigger, HLT) 
is implemented in software running on an event filter CPU farm. In the first stage of the 
software trigger (HLTl) a partial event reconstruction is performed. The second stage 
(HLT2) performs a full event reconstruction to enhance the signal purity further. 

The forward geometry of LHCb allows the first level trigger to collect events containing 
one or two muons with very low transverse momenta (pt)'- niore than 90% of the data 
were collected with a p^ threshold of 1.4 GeV/c for single muon triggers and Pt(/Wi) > 0.48 
GeV/c and Pt(/^2) > 0.56 GeV/c for dimuon triggers. The threshold for the hadron 
trigger varied in the range 2.6 to 3.6 GeV. The single muon trigger line in the HLT 
requires either > 1-8 GeV/c or includes a cut on the impact parameter (IP) with 
respect to the primary vertex, which allows for a lower p^ requirement (pt > 0.8 GeV/c, 
IP > 0.11 mm). The dimuon trigger line requires muon pairs of opposite charge forming a 
common vertex and an invariant mass M^^ > 4.7 GeV/c^. A second trigger line, primarily 
to select J/tp events, requires 2.97 < M^^ < 3.21 GeV/c^. The remaining region of the 
dimuon invariant mass is also covered by trigger lines that in addition require the dimuon 
secondary vertex to be well separated from the primary vertex. Other HLT trigger lines 
select generic displaced vertices, providing a high efficiency for purely hadronic decays 
(for instance — )■ h~^h ~). 
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3 Analysis strategy 



An important feature of this analysis is to rely as much as possible on data and to restrict 
to a minimum the use of simulation. Nevertheless, some Monte Carlo (MC) simulation 
has been used, based on the PYTHIA 6.4 generator fll] and the GEANT4 package [12] 
for detector simulation. The first part of the analysis is the event selection (Section |4]), 
which significantly reduces the size of the dataset by rejecting most of the background. 

The second part consists of the study of three normalization channels with known 
branching ratios: 5+ J/ip{n+ n-)K+ , 5° ^ J/'ip{ij,+n-)(j){K+K-) and 5°^ K+n-. 
Using each of these normalization channels, B{B^ — )■ can be calculated as: 

^REC SELIREC TRIGISEL . jw _ 

^norm"^norni tnorm J norm ^g^'M^A' 



, ,,+ ,,-\ — R V norm ^nuim j norm 

'^y^q ~^ H' H' ) — ^norm ^ SPUREn TRTHIRET, ^ -C ^ 



REG SEL|REC TRIG|SEL AT 

sig sig sig 1 

= 0^0^^+^- X NBO^f,+^- , (1) 

where 0:30^11+1^- denotes the normalization factor, f^o denotes the probability that a b- 
quark fragments into a and /norm denotes the probability that a 6-quark fragments 
into the 6-hadron relevant for the chosen normalization channel with branching fraction 
-Bnorm- The recoustructiou efficiency (e^^'-') includes the acceptance and particle iden- 
tification, while gSELiREC clgnotes the selection efficiency on reconstructed events. The 
trigger efficiency on selected events is denoted by e^RiGiSEL^ ^jj^jg normalization ensures 
that knowledge of the absolute luminosity and bb production cross-section are not needed, 
and that many systematic uncertainties cancel in the ratio of the efficiencies. The event 
selection for these channels is specifically designed to be as close as possible to the signal 
selection. The ratios of reconstruction and selection efficiencies are estimated from the 
simulation, while the ratios of trigger efficiencies on selected events are determined from 
data (Section [5]). 

In the third part of the analysis (Section [6]) each selected event is given a probability 
to be signal or background in a two-dimensional probability space defined by the dimuon 
invariant mass and a geometrical likelihood (GL). The dimuon invariant mass and GL 
probability density functions for both signal and background are determined from data. 
This procedure ensures that even though the GL is defined using simulated events, the 
result will not be biased by discrepancies between data and simulation. 

Section [7] describes the final measurement. In order to avoid unconscious bias in the 
analysis, the invariant mass region for the signal (M^o ±60 MeV/c^ and M^o ±60 MeV/c^) 
was blinded until the selection criteria and analysis procedure had been defined. 



4 Event selection 

The selection has been designed in order to reduce the data sample to a manageable 
level by simultaneously keeping the efficiency for the signals as high as possible and the 
selection between signals and control channels as similar as possible. This last requirement 
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is needed to minimize the systematic uncertainty in the ratio of the selection efficiencies. 
The optimal separation between signal and background is left to the likelihoods (Sect. |6|. 
The basic cuts of the selection have been defined on Monte Carlo simulation [lO] and then 
adapted to the data. 

The data for the signal and all the normalization candidates are selected using either 
an inclusive two-body or a J/ip selection. Tracks are first required to be of good quality 
(x^/ndf < 5) and to be displaced with respect to the closest primary vertex (xjp/ndf > 
12.5, where Xip is the difference between the of primary vertex built with and 
without the considered track). To reject bad combinations before performing the vertex 
fit, the two tracks are required to have a distance of closest approach of less than 0.3 mm. 
The secondary vertex is required to be well ffited (x^/ndf < 9) and must be clearly 
separated from the primary in the forward direction (vertex distance significance larger 
than 15). When more than one primary vertex is reconstructed, the one that gives the 
minimum impact parameter significance for the candidate is chosen. The reconstructed 
candidate has to point to the primary vertex (xjp/ndf < 12.5) in the case of the inclusive 
two-body selection. For all selections, the primary vertex is refitted excluding the signal 
tracks before calculating the Xip/ndf and the vertex distance significance of the candidate. 

Tracks are defined as muons if they have at least one hit in two to four of the last four 
muon stations depending on the momentum. In the inclusive J/ip selection both tracks 
must be identified as muons and have an invariant mass within 60 MeV/c^ of the nominal 
J/ip mass. The efficiency of the muon identification requirement has been measured using 
an inclusive sample of J/ip events where one of the tracks does not use any information 
from the muon chambers. The efficiency measured with data agrees with MC expectations 
as a function of momentum within 2%, and the residual differences are taken into account 
in the systematic uncertainties. 

Events passing the two-body selection are considered — )■ /i"*"/!" candidates if both 
tracks pass the muon identification criteria, and their invariant mass lies within 60 MeV/ 
of the nominal mass. The invariant mass of the h^h^ candidates has to be 

within 600 MeV/c^ of the nominal 5° mass. As the acceptance of the tracking stations 
is larger than the muon chambers, the selected B^ — t- h^}i~ candidates are required to 
have both tracks within the muon chamber acceptance to minimize the differences with 
B^ ix^ ^~ . The total efficiencies including acceptance, reconstruction and selection 
criteria on MC 5° — > /i^/i^ and B^ — )• h'^h'~ events are 5.5% and 4.5% respectively; the 
main difference is due to material interactions. Assuming the SM branching ratio, 0.3 
B^ — fi^fi^ and 0.04 5° — )■ fi~^fi~ events are expected after all selection requirements. 
There are 343 (342) 5° — )■ /i"^yU~ candidates selected from data in the (5°) mass 
window. 

The dominant background after the B^ — )■ /i"*"//" selection is expected to be bb — )■ 



fifiX 10 13 . This is confirmed by comparing the kinematical distributions of the sideband 
data with a 66 — )■ fifiX MC sample. The muon misidentification probability as a function 
of momentum obtained from data using Kg — )■ 7r"'"'7r~, A — )■ pn" and — > K^K~ decays 
is in good agreement with MC expectations. An estimate of the background coming from 
misidentified hadrons is obtained by reweighting the hadron misidentification probability 
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using the momentum spectrum of the background in the invariant mass sidebands. The 
single hadron average misidentification probabihty is measured to be (7.1 ±0.5) x 10~^ and 
the double hadron misidentification probability is (3.5 ±0.9) x 10~^, where the correlation 
between the momenta of the two hadrons is taken into account. About 10% of the 
background is due to pairs consisting of one real muon and a hadron misidentified as 
muon, mostly from decays in fiight. The contribution from double misidentified hadrons is 
negligible. The number of expected — )■ h~^h ~ candidates misidentified as — )■ 
within the search window of ±60MeV/c^ around the B^ (B^) mass is less than 0.1 (0.3). 

For the B^ — t- J/ipK^ and B^ — )■ J/ip(f) normalization channels some additional 
cuts are required. In the former case, the candidates are required to pass the same 
track quality and impact parameter cuts as the muons from the J/ip. For B^ — )■ J/ipcp 
candidates, the K^K~ invariant mass is required to be within ±10MeV/c^ of the (/> 
mass [l4j. The B vertex has to be of good quality, x^/ndf < 25. The requirements 
on Xjp/ndf and vertex separation significance for the B candidate are the same as those 
for the signal selection. The total efficiencies including acceptance, reconstruction and 
selection criteria for MC B^ J/ipK^ and 5° — )■ J/ipcf) events are 2.6% and 1.3% 
respectively. 



5 Evaluation of the normalization factor 

The branching fractions of the three normalization channels, B^ — )■ J/ip{fi'^^~)K^ , B^ — > 
J H~)(j){K~^ K~) and )■ JC+vr", are shown in Table [l] The first two decays have 
similar trigger and muon identification efficiency to the signal but a different number of 
particles in the final state, while the third channel has the same two-body topology but 
is selected with the hadronic trigger. The branching ratio of the 5° — ?■ J/ip(f) decay is not 
known precisely (~ 25%) but has the advantage that the normalization of B^ — t- ^'^ ^~ 
with a 5° decay does not require the knowledge of the ratio of fragmentation fractions. 



which has an uncertainty of ~ 13% 15 



5.1 Ratio of reconstruction and selection efficiencies 

The accuracy of the simulation of the reconstruction efficiency e^^*" relies on the knowl- 
edge of the detector geometrical acceptance, the material interactions and the tracking 
efficiency. The uncertainty on the tracking efficiency is taken to be 4% per track js] and 
this is the dominant source of the systematic uncertainty in the ratio with the two normal- 
ization channels involving J/ip mesons. The ratios enram/eSg*" predicted by the simulation 
are 0.58 ±0.02 {B+ J/ijK+), 0.39 ±0.03 (5° ^ J/ipcj)) and 0.75 ±0.05 (S° ^ K+tx-). 

The effect of an extra particle on the ratio of e^^*^ is cross-checked in data using the 
decay B^ — )■ J/ipdi'^ ^^)K*^{K~^7r~). Selecting B^ and B^ in a similar phase-space region, 
the ratio of B^ J/ipK*^ and B^ — ^ J /ipK^ yields (corrected for the ratio of branching 
ratios) is a good measure of the ratio of e^^*^ between B^ — t- J /ipK^ and B^ — )■ ^~ ., as 



shown in Ref. 10 . The measurement from data is 0.59 ± 0.04, in good agreement with 
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the estimate from MC simulation (0.58 ± 0.02). 

The accuracy of the simulation of gSEL|REC Yehes on how well the MC describes the 
variables entering the selection. Of these only the IP distributions show a significant 
discrepancy: the data are measured to have ~ 10% worse resolution than the simula- 
tion. Smearing the track parameters in MC to reproduce the IP distribution in data 
changes the selection efficiencies by 5-7% depending on the channel. However, the ra- 
tios of enorm^^*"/e^g'"'^^*" remain unchanged within the MC statistical uncertainty. The 
ratios predicted by the MC are 0.85 ± 0.01 {B+ J/^pK+), 0.63 ± 0.01 (5° ^ J/^cf)) 
and 1.09 ± 0.01 {B^ — t- J^+tt'), where the uncertainties correspond to the MC statistical 
uncertainties. The largest contribution to the difference in the selection efficiencies for 
— )■ J/ipK^ and B^ — J/ipcj) compared to the signal comes from the additional Xip 
requirements on the extra tracks in the normalization channels. For the B^ — )■ K^tt~ 
normalization channel, the selection efficiency is higher than for the signal as the tight 
(±60MeV/c^) mass window is not applied to the B^ — )■ h^h" channel. The ratios of 
efficiencies including acceptance, reconstruction and selection between normalization and 
signal decays are shown in Table [Tj 



5.2 Ratio of trigger efficiencies 
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The trigger efficiency e^RiGiSEL ^^^^^ -j^^ estimated from data as described in Ref. 
Events that would have triggered even without the presence of the decay products of the 
signal under study are tagged as TIS events (Trigger Independent of Signal). TIS events 
are mostly triggered by the decay products of the other b which can be in the acceptance 
given the forward geometry of LHCb. 

If the presence of the signal under study alone is sufficient to trigger, events are 
tagged as TOS (Trigger On Signal). An event can also be TIS and TOS simultaneously 
(TIS&TOS). The overall trigger efficiency on selected events can then be expressed as: 

TKIGISEL ^ ^ N^N^ ^ TisiV^ .21 

jySEL jySEL jyTIS jyTis ' 

where N^^^ is not directly observable as it corresponds to a sample of selected events for 
a fully efficient trigger. The TIS efficiency (e"""^^) can however be measured directly on 
data using the ratio N^'^^^'^os ^ j\jtos ^ Therefore gTRiG|SEL ^^^^^ expressed in terms of 
fully observable quantities. 

The trigger efficiency for selecting B^ JN)K^ and 5^ — > J/ipip is obtained from a 
large inclusive sample of J/V' events using Eq. 2 The result is e™'^'^^'" = (85.9±0.9stat ± 
2.0syst)%, where the systematic uncertainty reflects the approximation of the method as 
seen in the simulation. This efficiency is parameterized as a function of the largest 
and the largest IP of the two muons. Using the phase space of the B^ — )■ fi~^fi~ decay 

in these two variables, the trigger efficiency for the signal is evaluated to be e^^^'^^^ = 

(89.9±0.8stat±4.0syst)%, where the systematic uncertainty is increased to account for the 
limitations of using only two variables (the largest pt and IP of the muons in the final 
state) to parameterize the trigger response. 
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Table 1: Summary of the factors and their uncertainties needed to calculate the normaliza- 
tion factors {aso^^+fj.-) for the three normalization channels considered. The branching 

The trigger efficiency and number of 5° — )■ K~^Tr~ 



14 



16 



ratios are taken from Refs. 
candidates correspond to only TIS events, as described in the text. 



jREC gSELIREC ^TRIG|SEL 

^ BMC sbl]rec trigisel -^iiorm '^B°~^f2 + fi^ '^B°—^ti+fj,- 

^sig ^aig ^sig 

(xlO-^) (xlO-^) 

B+ ^ J/'i(j{^i+fi-)K+ 5.98 ±0.22 0.49 ± 0.02 0.96 ± 0.05 12, 366 ±403 8.4 ± 1.3 2.27 ±0.18 

B^^ ^ J/tl){^i+^-)(t){K+K-) 3.4 ±0.9 0.25 ± 0.02 0.96 ± 0.05 760 ± 71 10.5 ± 2.9 2.83 ± 0.86 

B^ ^ K+TT- 1.94 ±0.06 0.82 ± 0.06 0.072 ±0.010 578 ± 74 7.3 ±1.8 1.99 ±0.40 



In the case of the — K^-k^ normalization channel, the trigger efficiency is com- 
puted using the same events that are used for the normalization in Eq. [T] Therefore, 
combining Eqs. [T] and |2] results in an expression equivalent to a normalization which uses 
only TIS events. The total number of these events after the first trigger steps (LO and 
HLTl) is 578, accepting all HLT2 triggers, which does not allow for a precise measure- 
ment of e"""^^. Instead, this efficiency can be measured using another control channel, 
5+ J/^K+, with the result: e^^^(LO x HLTl) = (6.9 ± 0.6)%. The small correction 
due to the HLT2 trigger inefficiency on selected — )• K^t^' candidates is taken from 
the trigger emulation. The ratios enorl^'^^'"/e™*^'^^'" for the three normalization channels 
are given in Table [T} 

5.3 Overall normalization factor 

The yields needed to evaluate the normalization factor for the two channels containing a 
J/ %p in the final state are obtained from a Gaussian fit to the invariant mass distribution. 
The number of candidates can be seen in Table [l} where the uncertainty is dominated by 
the differences observed using different fitting models. In the case of the B^ — )• K^tt~ 
decay, the RICH particle identification and mass information are used to extract the 
fraction of K^ti~ events from the selected inclusive B^ — > h^h ~ sample. The efficiency 
of the kaon and pion identification requirements is not needed since their ratio is extracted 
from the known ratio of B^ — )■ vr+vr" and B^ — )■ K^7r~ branching ratios as described in 
Ref. 10 . The number of TIS B^ — K^ti~ events observed is shown in Table [l] 



As can be seen in Table [T} the normalization factors calculated using the three com- 
plementary channels give compatible results. The final normalization factor is a weighted 
average which takes into account all the sources of correlations, in particular the dominant 
one coming from the uncertainty on fd/ fs = 3.71 ± 0.47 with the result: 
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aBO^^+^- = (8.6 ±1.1) X 10-^ 
"50-^^+^- = (2.24 ± 0.16) X 10"^ . 

6 Signal and background likelihoods 

After the selection described in Section |4] the signal purity assuming the SM branching 
ratio is still about 10"'^ for — )■ fi^fi~ and 10"^ for — )■ Further discrimina- 

tion is achieved through the combination of two independent variables: the multivariate 
analysis discriminant likelihood, GL, combining information that is largely based on the 
topology of the event, and the invariant mass. The GL is defined using the statistical 



method described in Refs. 13 , 17 . The GL is defined to have a flat distribution between 
zero and one for signal candidates, and to cluster around zero for background candidates. 
The geometrical variables included in the definition of the GL are intended to be a com- 
plete set describing the properties of the decay, and the transverse momentum of the B 
candidate is also included, which is uncorrelated with the invariant mass. The variables 
used in the definition of the GL are: 

• Lifetime of the B candidate. This variable is computed using the distance between 
the secondary vertex and primary vertex, and the reconstructed momentum of the 
B candidate. When more than one primary vertex is reconstructed, the one that 
gives the minimum B impact parameter significance is chosen. 

• Muon impact parameter . This is the lowest impact parameter of the two muon 
candidates with respect to any primary vertex reconstructed in the event. 

• Impact parameter of the B candidate. 

• Distance of closest approach between the two muon candidates. 

• Isolation. For each of the muon candidates, a search is performed for other tracks 



that can make a good vertex with the muon candidate, as in Ref. 10 . The number 
of compatible tracks is used as the discriminant variable. 

• Transverse momentum of the B candidate. 

The analysis is performed in two-dimensional bins of invariant mass and GL. The invariant 
mass in the signal regions (±60 MeV/c^ around the S° and the B^ masses) is divided into 
six bins of equal width, and the GL into four bins of equal width distributed between zero 
and one. A probability to be signal or background is assigned to events falling in each 
bin. 
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6.1 Signal geometrical likelihood 



Although the GL variable described above was defined using MC events, the probability 
that a signal event has a given value of GL is obtained from data using inclusive i?^ — )• 
h^h'~ events. Studies with large samples of MC events show that after reconstruction 
and selection the GL distributions obtained from — > fi^fx'^ and B^ — )■ h^h~ signal 
events agree within uncertainties (3%). On the other hand, the two distributions are 
different after the trigger is emulated. This bias can be removed if only TIS B^ ^ h~^h~ 
events are used in the evaluation of the GL distribution. However, the total number 
of TIS -Bg — )■ h^h~ events after all trigger steps (LO, HLTl and HLT2) is 152 which is 
insufficient. Instead, for the B^ — )■ h^h'~ events, the first two trigger steps (LO and HLTl) 
are required to be TIS while at the HLT2 step any of the HLT2 triggers are accepted. 
This yields 955 events. The GL distribution obtained using these events is corrected for 
the small bias (< 5%) introduced at the HLT2 stage using the trigger emulation. Detailed 
checks with a large sample of — )■ K~7i'~^ decays have validated this procedure. 

The number of TIS 5° — ?■ h^h^ events in each GL bin is obtained from a fit to 
the inclusive mass distribution 18 assigning the muon mass to the two particles. The 
measured fractions in each GL bin can be seen in Fig. [T] and are quoted in Table [2j The 
systematic uncertainties are included, estimated by comparing the results from the inclu- 
sive 5° — )• h'^h'~ fit model with those obtained using a double Crystal Ball function fl9| 
and a simple background subtraction. The measured GL distribution obtained from TIS 
B'^ — )■ h^h^ events is compatible with a fiat distribution, as expected if the simulation 
reproduces correctly the data. 



6.2 Signal invariant mass likelihood 

The signal mass lineshape is parameterized using a Crystal Ball function [T9]. Two 
methods have been used to estimate the B^ — )■ fi'^fi~ mass resolution from data. The first 
of these methods uses an interpolation between the measured resolutions for cc resonances 
{J/ip , 'ip{2S)) and bb resonances (T(IS'), T{2S), T{3S)) decaying into two muons. It has 
been observed that over this mass range the dimuon invariant mass resolution depends 
linearly on the invariant mass of the muon pair to good approximation. Events selected in 
the mass ranges around the cc and bb resonances were weighted such that the momentum 
spectra of these resonances reproduce the expected momentum spectrum of the b hadron 
in the decay B^ — )■ The mass resolutions of the cc and bb resonances were then 

determined fitting a Crystal Ball {J/ip , T{1S)) or a Gaussian {'ip{2S), T{2S) and T(3S')) 
over exponential backgrounds. 

The mass resolution is defined as the a of the Crystal Ball when there is sufficient data 
to perform a fit with the Crystal Ball function {J/ip and T{1S)). Otherwise a Gaussian 
fit is made {ip{2S), T{2S), T{3S)) and the a of the Gaussian is used as an estimator of 
the a of the Crystal Ball. For the Crystal Ball function, the parameters describing the 
radiative tail are in good agreement between data and the Monte Carlo simulation. No 
systematic shifts in the resolution has been found by using a Crystal Ball or a Gaussian 
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Figure 1: Probability of signal events in bins of GL obtained from the inclusive sample 
of TIS — !■ h^h ~ events (solid squares). The background probabihty (open circles) is 
obtained from the events in the sidebands of the fifi invariant mass distribution in the B^ 
mass window. 

above the transition point. 

Interpolating linearly between the five fitted resolutions to M^o an invariant mass 
resolution of a = 26.83 ± 0.14 MeV/c^ was estimated for B^ — )■ The systematic 

uncertainty is estimated to be 1 MeV/ mainly due to the reweighting of the momentum 
spectrum of the dimuon resonances and the variation of the resolution over the width of 
the B^ — )■ /i signal region. 

The second method that was used to estimate the invariant mass resolution from 
data is to use the inclusive B^ — )■ h'^h ~ sample. The particle identification requirement 
would modify the momentum and transverse momentum spectrum of pions and kaons, 
and thus the mass resolution. Therefore, the fit is performed to the inclusive B^ ^ h^h~ 
sample without requiring particle identification and assigning the muon mass to the decay 
products. The fit has been performed in the GL range [0.25,1.0] and the results are shown 
in Fig. [2] The fitted parameters are: the mass resolution, the B^ and 5° masses, the signal 
yield, the combinatorial background yields, as well as the fraction of radiative tail and the 
parameters that describe the combinatorial background. The relative contributions of B^ 
and -Bg decays are fixed to their known values. The result of the fit for the mass resolution, 
a = 25.8 ± 1.0 MeV/c^, is consistent with the value obtained from the interpolation 
method. However, by varying the assumptions made for the parameters describing the 
partially reconstructed three-body 6-hadron decays (physical background), the estimate 
obtained for the resolution can change by up to 2.7 MeV/ c^. This is assigned as systematic 
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Figure 2: Fit of the invariant mass distribution for 5^ — t- h^h'^ candidates in the GL 
range [0.25, 1.0]. The pink dashed curve is the combinatorial background component, 
while the physical background is shown with a light-blue dotted curve. The remaining 
contributions are from the two-body decays of the B^, and A°. 



uncertainty for this method. 

The weighted average of the two methods, a = 26.7 ± 0.9MeV/c^, is taken as the 
invariant mass resolution and considered to be the same for B^ and B^ decays. The mean 
values of the masses obtained from the inclusive 5° — ?■ h~^h ~ fit are consistent with, but 
not as precise as, the values obtained using the exclusive decay modes 5° — )■ K^7i~ and 
5° K+R- isolated using the RICH particle identification: Mbo = 5275.0 ± 1.0 MeV/c^ 
and M^o = 5363.1 ± 1.5 MeV/c^, which are used in the evaluation of the invariant mass 
likelihood. The mean values of the masses are ~ 0.07% below the known values 14 which 
is attributed to a small residual miscalibration of the magnetic field map. However this 
has no impact on the analysis, provided that the search windows are centred around the 
measured values. 



6.3 Background likelihood 

The mass sidebands are defined in the range between Mbo ± 600 (1200)MeV/c^ for the 
lower (upper) two GL bins, excluding the two search windows (M^o ± 60MeV/c^). The 
background in the mass sidebands is fitted with an exponential function, /(M) = Ae"^^^. 
The value of the exponential index k is fitted independently in each GL bin, in order 
to account for potentially different background compositions. The distribution of the 
invariant mass for each GL bin is shown in Fig. [3} and the predictions for the numbers of 
events in the signal regions can be seen in Tables |3] and |4j The background probability in 
the 5° mass window as a function of GL is shown in Fig. and in Table M The results 
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Table 2: Probability of signal events in bins of GL obtained from the inclusive sample of 
TIS 5° — )• h'^h ~ events. The background probability in the 5^ mass window is obtained 
from the events in the sidebands of the dimuon invariant mass distribution. 



GL bin Signal prob. Background prob. 



0.0- 


-0.25 


0.360 ±0.130 


U.y (OJ_o.0032 


0.25 


-0.5 


0.239 ±0.096 


n noi O-I-0.0030 


0.5- 


-0.75 


0.176 ±0.046 


U.UU4d_0 0010 


0.75 


- 1.0 


0.225 ±0.036 


00024+° °°°3^ 

U.UUUZ4_0 00015 




Figure 3: Distribution of the /i/i invariant mass for different GL bins: (a) [0,0.25], (b) 
[0.25,0.5], (c) [0.5,0.75], (d) [0.75, 1.0]. The blue solid lines show the interpolation model 
used and the dashed line shows the result of the interpolation in the search windows. 



have been checked by fixing the exponential index k to be the same in all GL bins, using 
a double exponential, or using a simple linear fit in the region around the signal window. 
In all cases the predicted background is consistent with the result of the exponential fit 
with different k values, although the quality of the fit is significantly worse when k is 
forced to be the same for all bins. 
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Figure 4: Observed distribution of selected dimuon events in the GL vs invariant mass 
plane. The orange short-dashed (green long-dashed) lines indicate the ±60 MeV/ search 
window around the (B^)- 

7 Results 

For each of the 24 bins (4 bins in GL and 6 bins in mass) the expected number of 
background events is computed from the fits to the invariant mass sidebands described 
in Section 6^ The results are shown in Tables [3] and |4j The expected numbers of 



signal events are computed using the normalization factors from Section |5j and the signal 



likelihoods computed in Section 6.1 and Section 6.2 for a given value of B{B^ — )■ fi^fi" 
The expected numbers of signal events for the SM branching ratios are shown in Tables [3] 
and |4} The distribution of observed events in the GL vs invariant mass plane can be seen 
in Fig. |4| and the observed number of events in each bin are given in Tables |3] and |4} The 
compatibility of the observed distribution of events in the GL vs invariant mass plane with 
a given branching ratio hypothesis is evaluated using the CLg method |20] . This provides 
two estimators: CLg is a measure of the compatibility of the observed distribution with the 
signal hypothesis, while CLb is a measure of the compatibility with the background-only 
hypothesis. The observed distribution of CLg as a function of the assumed branching ratio 
can be seen in Fig. [5j The expected distributions of possible values of CLg assuming the 
background-only hypothesis are also shown in the same figure as a green shaded area that 
covers the region of ± la of background compatible observations. The uncertainties in the 
signal and background likelihoods (Section [6]) and normalization factors (Section |5]) are 
used to compute the uncertainties in the background and signal predictions in Tables [3] 
and |4} These uncertainties are the only source of systematic uncertainty and they are 



included in the CLg using the techniques described in Ref. 20 . Given the specific pattern 
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Figure 5: (a) Observed (solid curve) and expected (dashed curve) CLg values as a function 
of B{B^ — )■ fi^fi~). The green shaded area contains the ilu interval of possible results 
compatible with the expected value when only background is observed. The 90 % (95 %) 
C.L. observed value is identified by the solid (dashed) line, (b) the same for B{B^ — )• 



of the observed events, the systematic uncertainty on the background prediction has a 
negligible effect on the quoted limit. The effect of the uncertainty on the signal prediction 
increases the quoted limits by less than 3%. 



The evaluation of CLb 20 gives a probability of about 20% for the compatibility with 
the background-only hypothesis for both the and B^ decays. This low value can be 
attributed to the slight deficit of observed events in the most sensitive bins, as can be seen 
in Tables [3] and |4j As no significant deviation from the background-only hypothesis is 
observed, upper limits are computed using the CLs distributions in Fig. [5] with the results 

i3(fi°-> /iV") < 4.3 (5.6) X 10~* at 90% (95%) C.L., 
B{B^^ 12+ fi-) < 1.2 (1.5) X 10-^ at 90% (95%) C.L., 

while the expected values of the limits are B{B^^ Z^^/U ) < 5.1 (6.5) x 10~* and B{B^ ^ 
fi+fi^) < 1.4 (1.8) X 10^^ at 90% (95%) C.L. The limits observed are similar to the 
best published limits [5] for the decay B^ — )■ ^+ ^~ and more restrictive for the decay 
5° 



8 Conclusions 

With about 37pb~^ of integrated luminosity, LHCb has searched for the rare decays 
— 7- yU^/i^ and B^ — t- and reached sensitivities similar to the existing limits from 

the Tevatron. This could be achieved due to the large acceptance and trigger efficiency 
of LHCb, as well as the larger hh cross-section in pp collisions at y/s = 7 TeV. The 
observed events are compatible with the background expectations, and the upper limits 
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Table 3: Expected background, expected SM signal and observed number of events in bins 
of GL and invariant mass, in the ±60 MeV/ (? mass window around the mass central 
value of 5363.1 MeV/c^. 
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0220+° °°^'' 

U.UZZU_o 0081 

55 


1 OO+0.18 
i.^S_0.17 

0146+° °°*'^ 

U.UiW_o 0054 

2 


269+° °^^ 

U.ZUy_o 062 

01 07+°-'^°^'^ 

U.UlU/_oo027 




01^1 +0 0195 
U.Uidi_o 0094 

0.0138^°:°°^^ 



^ Exp. bkg. 

V [-20, 0] 

B. Exp. sig. 

^ Observed 


55.3;}i 

0.038;H1^ 
73 


1 24+°-i^ 
-L-^^-o.ie 

U.UZd_o 010 




n 9 c 7+0.069 
U.Zd(_oo59 

n ni QQ+0.0063 

U.Ui»d_o.oo47 




U.UiOy_o 0086 

O9'^^+0 0060 
U.UZOd_o 0044 




S Exp. bkg. 

1 [0, 20] ^ 

L ) J ^xp. sig. 

^ Observed 


54.4;}i 

0.038;n}^ 
60 


-L-^J^-0.16 
U.UZd_o 010 




n 94K+0.066 
U.Z^O_o 057 

n ni oQ+0.0063 

U.Ui»d_o.oo47 




ni 98+0.0165 
U-Ui/»_o.oo80 

n9'^^+0 0060 
U.UZOd_o 0044 




Exp. bkg. 
[20, 40] Exp. sig. 

Observed 


53.6;}:J 

0220+° °°^'' 

U.UZZU_o 0081 

53 


1 

-L-J^O-0.15 

0146+° °°^^ 

U.UiW_o 0054 

2 


n 9qc+0.063 
U.ZOd_o 054 

01 07+0 0036 
U.UiUr_oo027 




0.0118^gr,^ 

0.0138^°:°°^^ 



Exp. bkg. 

[40, 60] 

^ Exp. sig. 

Observed 


52.8;}:« 
o.oo76;°:«g^^ 

55 


1 14+0.16 

0050+° °°^^ 

U.UUOU_o 0019 

1 


n 924+°-°^° 

U.ZZ^_0 052 

00^7+0 0013 
U.UUO<_oooiO 




0.0108^^^^° 

0047+° °°i3 
u.uu^(_oooio 





are evaluated to be 

B{Bl^ /iV") < 5.6 X 10"^ at 95% C.L., 
B[B^^ < 1.5 X 10"^ at 95% C.L., 

while the expected values of the limits are — )■ /i^/x^) < 6.5 x 10"® and B{B^ — )■ 
/iV") < 1.8 X lO-s at 95% C.L. 

The LHC is expected to deliver a much larger sample of collisions in 2011. Given 
the low level of background in the most sensitive bins shown in Tables [3] and |4| LHCb 
should be able to explore the interesting region of branching ratios at the 10~* level in 
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Table 4: Expected background, expected SM signal and observed number of events in 
bins of GL and invariant mass, in the ±60MeV/c^ mass window around the central 
value of 5275.0 MeV/c^. 





[0, 0.25] 


GL 

[0.25, 0.5] 


bin 

[0.5, 0.75] 


[0.75, 1] 


Exp. bkg. 

[-60, -40] ^ 

^ ' ^ Exp. sig. 

Observed 


60.811:? 

0.00090l°S 

59 


1.48l°:l^ 

0.00060l°S 
2 


0.3451°:°?^ 

0.00044loo™16 



0.0241°:°?! 

0.00056l°°°°l? 



Exp. bkg. 

[-40, -20] ^ 

^r- Exp. sig. 

Observed 

> 


59.91}:! 

0.00263l°oooi 
67 


1.4410:1? 

0.001741°™ 



0.00128l°S 



0.0221°:°!^ 

0.00164l°°°°i 



!^ Exp. bkg. 

^ [-20, 0] p 

■S ^ 'J Exp. sig. 

m Observed 


59.01}:! 

DO 


1.401^:1? 
o.oo3ol°:°°lt 

Z 


n oi r:+0.077 
U.Oid_o.067 

0.002191°:^ 

(J 


0.0201°:°!^ 

0.00280l°:S 

(J 


03 

S Exp. bkg. 
g [0; 20] Exp. sig. 

$ Observed 

a 


58.111:1 

0.00451°:™1? 
60 


i.36l°:l« 




0.3001°:°^ 

0.00219l°:S 



o.oi9l°:°?l 
o.oo28ol°:S 




l-H 

Exp. bkg. 

[20, 40] 

Observed 


57.311:1 

U.UU^OO_o.oo093 

42 


1 qq+0.17 
-'^•"J'J-0.16 

001 74+0 00076 
U.UUl (y:_Q 00061 

2 


0.2871°:°™ 

o.ooi28l°:S 
1 


'J-'J^ ' -0.010 

001 64+0 00035 

U.UU1U4_0.00025 




Exp. bkg. 

[40, 60] 

Exp. sig. 
Observed 


56.411:1 

0.00090l°:S 
49 


■■-■^^-0.16 

o.ooo601°:S 

2 


Q 274+0 067 

n nnn44+°°°°" 

U.UUU^4_0.00011 




0.0158l°:°J,l 

0.00056l°:°°°lg 




the near future. 
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